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Abstract. This paper discusses the topic of the minimum width of a regular resolution 
refutation of a set of clauses. The main result shows that there are examples having small 
regular resolution refutations, for which any regular refutation must contain a large clause. 
This forms a contrast with corresponding results for general resolution refutations. 



Introduction 

Recent results [H [13] showing near-exponential separations between the size of regular and 
general refutations of certain sets of clauses also show a separation of general and regular 
resolution width. That is to say, the examples used in showing the size separation have 
large regular resolution width, but bounded general resolution width. 

This observation suggests that it might be possible to prove results for regular resolution 
similar to those of Ben-Sasson and Wigderson [5] for tree resolution and general resolution. 
The main theorem below shows that this hope is bound to be disappointed; it exhibits 
examples having small regular resolution size, but large regular width. 

The first part of the paper gives a characterization of regular resolution width, in the 
style of Atserias and Dalmau [3J. The second part discusses the relationship between the 
size and width of regular resolution refutations. 

1. Resolution proofs and their width 

A literal is a propositional variable x or its negation —>x. A clause is a set of literals, 
interpreted as the disjunction of the set. For clauses containing exactly one positive literal, 
we use the implication px,---,Pk — ► 1 as alternative notation for the clause -ipiV- • • V-ij>fc Vg. 
For notational convenience, we shall also allow the case where the positive literal q is replaced 
by the propositional constant _L. For any assignment a, cr(_L) = 0, so that the expression 
u pi, . . . ,pk — > -L" is an alternative notation for the purely negative clause ->pi V • • • V ->pk 
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If p is a variable, and C a clause, then we say that p has a positive occurrence in C if 
p is in C , and a negative occurrence in C if —>p is in C. In addition, we shall say that _L 
has a positive occurrence in the purely negative Horn clause p±, . . . , p^ — > _L. If E is a set 
of clauses, and x,y are variables in E, or the propositional constant _L, then we say that 
there is an implicational chain from x to y in E if there is a sequence x = xq, . . . , xp. = y 
of variables (or constants) and a sequence C±, . . . , of clauses so that for all i, < i < k, 
Xi-i occurs negatively and Xi positively in Cj. 

The resolution rule allows us to derive the resolvent CVD from the clauses C V x and 
D V -ix by resolving on the variable x. A resolution derivation of a clause C from a set of 
clauses X consists of a sequence of clauses in which each clause is either a clause of E, or 
derived from earlier clauses by resolution, and C is the last clause in the sequence; it is a 
refutation of E if C is the empty clause A. 

The size of a resolution proof is the number of occurrences of clauses in the proof - 
that is to say, the length of the proof considered as a sequence. For a contradictory set of 
clauses X, we write S*(E) for the minimum size of a resolution refutation of E. A resolution 
proof is a tree-style proof if every clause in the proof is used at most once as a premiss in 
a resolution inference. We write 5"t(E) for the minimum size of a tree-style refutation of a 
contradictory set of clauses E. A resolution refutation of a set of clauses E is an input proof 
if in every application of the resolution rule in it, at least one premiss of the application is 
an input clause in E. Every input refutation is automatically a tree-style refutation. 

An irregularity in a resolution proof is a sequence of clauses Ci,. . . ,C& so that Ci + \ 
is derived from Cj (that is, Ci is one of the premisses of a resolution inference in which 
the conclusion is CVfi), and there is a variable that occurs in C\ and Ck, but not in any 
intermediate clause Cj, 1 < j < k. A resolution proof is regular if it contains no irregularity. 

If V is a set of propositional variables, then an assignment is a Boolean function defined 
on a subset of V, that is, an assignment of {0, 1} to some or all of the variables in V . If 
a is an assignment, then we write \a\ for the cardinality of a, the number of variables to 
which a assigns values. The result of restricting a clause C by setting a literal I is defined 
as follows. If the literal I occurs in C , then C[l := 1] = 1, while C[l := 0] is C \ {/}. If E is 
a set of clauses, and v 6 {0, 1}, then E|7 := v] is the set of clauses {C[l := v] : C G E} \ {1}. 

The width of a clause is the number of literals in it. The width w (E) of a set of clauses is 
the maximum width of a clause in E, while the width of a resolution proof is the maximum 
width of a clause in it. If E is a contradictory set of clauses, then we define the refutation 
width of E, written u;(E h 0), to be the minimum width of a resolution refutation of E. 
If J 7 is a family of resolution proofs, we define a restricted notion of refutation width, the 
J 7 -refutation width, w(S 0), to be the minimum width of a refutation of E that belongs 
to T. In particular, we define the regular refutation width, w(E 0), of a contradictory 
set of clauses E to be the 7£-refutation width where 1Z is the class of all regular resolution 
proofs. 

The notation logx stands for the base two logarithm of x, and log fc x for (logx) fc . 

2. Characterization of general resolution width 

In this section, we give a proof of a result of Atserias and Dalmau [3] characterizing the 
width of general resolution refutations. The characterization is in terms of a two player 
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game, that we shall call the k-width game, played by the Prover and the Adversary. The 
rules of the game are as follows. 

The players are given a set of clauses £, on a set V of variables, and an integer parameter 
k > 0. The players together construct a succession of assignments to the variables in V. 
Initially, the assignment is empty. Each round of the game proceeds as follows, starting from 
a current assignment. First, the Prover queries an unassigned variable, and the Adversary 
assigns a value to it. Second, the Prover is allowed to delete some of the values of the 
variables in V from the assignment resulting from the Adversary's reply; the result is the 
new current assignment. 

The Adversary can win in two ways. First, if the current assignment (after deletions) 
assigns values to more than k variables; second, if an earlier assignment is repeated during 
the play of the game. The Prover wins if the current assignment falsifies an initial clause 
in E. Clearly every play of the game must eventually terminate with a win for the Prover 
or for the Adversary (Atserias and Dalmau define their game so that when the Adversary 
wins, the game can continue infinitely). 

Definition 2.1. If E is a set of clauses on a set V of variables, then a non-empty family A 
of ^/-assignments is an extendible k-family for £ if it satisfies the following conditions: 

(1) No assignment in A falsifies a clause in £; 

(2) Each assignment a in A satisfies the condition \a\ < k; 

(3) If a G A, and (3 Q a, then f3 G A; 

(4) If a G A, \a\ < k, and x G V, then there is a (3 G A, so that a. C /3, and j3{x) is defined. 

The next theorem shows that a resolution refutation of width k constitutes a winning 
strategy for the Prover, while an extendible k + 1-family provides a winning strategy for 
the Adversary. 

Theorem 2.2. [Atserias and Dalmau 2003] Let £ be a set of clauses, and k > u>(£). 
Then the following are equivalent: 

(1) There is no resolution refutation ofT, of width k; 

(2) There is an extendible k + 1-family for £; 

(3) The Adversary wins the k + 1-width game based on £. 

Proof. First, let us suppose that there is no resolution refutation of E of width k. Let C be 
the set of all clauses having a resolution proof from E of width at most k; since u>(£) < k, 
SCC. Let A be the set of all assignments of size at most k + 1 that do not falsify any clause 
in C. We claim that A is an extendible k + 1-family for E. First, A is non-empty, because 
it contains the empty assignment (since C does not contain the empty clause). Second, A 
satisfies the first three conditions of Definition I2.1|. by construction. To prove the fourth 
condition, let a G A, and \a\ < k, x G V , but there is no extension (3 of a in A with f3(x) 
defined. It follows that there is a clause D G C that is falsified if we extend a by setting x 
to 0. Then D = E V x for some E, since otherwise a would falsify D. Similarly, there is a 
clause F V —<x in C that is falsified by the extension of a that sets x to 1. But then a must 
falsify E\/ F, showing that EV F has width at most k, since \a\ < k. Hence, it follows that 
E V F is in C, contradicting our assumption that a is in A. 

Second, let us suppose that there is an extendible 1-family for E. Then the Adversary 
can play the /c-width game on E by responding to the Prover's queries with the appropriate 



Atserias and Dalmau, following the tradition of finite model theory, call their players the Spoiler and 
the Duplicator, but our terminology seems clearer in the present context. 



4 



A. URQUHART 



assignment from the family, starting with the empty assignment. Since no assignment in the 
family falsifies an initial clause, this strategy must eventually end in a win for the Adversary, 
no matter how the Prover plays. 

Finally, let us suppose that there is a resolution refutation of E of width k. Then the 
refutation provides the Prover with a winning strategy in the k + 1-width game based on 
E. Starting from the empty clause at the root, the Prover follows a path in the refutation 
so that at each round, the assignment (after appropriate deletions) is a minimal assignment 
falsifying the current clause. The variable queried is the variable resolved upon to derive 
the current clause. This strategy must result in a win for the Prover when the path reaches 
a clause in E. □ 



3. Characterization of regular resolution width 

In the present section, we modify the result of Atserias and Dalmau to characterize the 
width of regular resolution refutations. The characterization is again in terms of a two 
player game, that we shall call the regular k-width game. The game is exactly the same as 
that described in the previous section, but with the added condition that the Prover can 
never query a previously queried variable. 

As in the case of general resolution width, we can characterize the regular resolution 
width in terms of extendible families of assignments. However, we need to redefine the 
notion of an assignment. In the earlier notion of assignment, a variable could be in three 
states: positive (1), negative (0), and unassigned (*). For the case of regular resolution, 
we define an extended assignment to be an assignment of values in which each variable can 
be in four states: positive (1), negative (0), unassigned (*), or forgotten (Kl). The empty 
extended assignment to a set V of variables consists of the assignment in which all variables 
in V are unassigned (*) (this should be distinguished from assignments in which all of the 
variables are unassigned or forgotten (K)). 

If a is an extended assignment, then those variables that are assigned the values or 1 
are the live variables in a, and we write |a| for the number of live variables in a. If a and 
j3 are extended assignments to a set of variables V, then we write a C /3 if /3 results from a 
by replacing some unassigned variables by live variables. We also write a C [5 if a results 
from (3 by forgetting some variables, that is, changing the value of a live (0 or 1) variable 
to M. 

As in the case of the earlier fc-width game, the players are given a set of clauses E, on a 
set V of variables, and an integer parameter k > 0. Together, they construct a succession of 
extended assignments to the variables in V . Initially, the assignment is empty. Each round 
of the game proceeds as follows. First, the Prover queries an unassigned variable, and the 
Adversary assigns a value to it. Next, the Prover is allowed to forget some of the variables 
in the resulting assignment, that is, to change the value of a live variable from or 1 to Kl 
(forgotten); the result is the new current assignment. 

Again, the Adversary can win in two ways. First, if the current assignment assigns 
values to more than k variables; second, if the Prover has not won up to this point, but 
there are no unqueried variables, so the Prover has no legal move. The Prover wins if 
the current assignment falsifies an initial clause in E (if this clause contains more than k 
variables, then we count this as a win for the Adversary). As before, every play of the game 
must eventually terminate with a win for the Prover or for the Adversary. 
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Definition 3.1. If £ is a set of clauses on a set V of variables, then a family A of extended 
^-assignments is a regular extendible k-family for £ if it satisfies the following conditions: 

(1) The empty assignment belongs to A; 

(2) No assignment in A falsifies a clause in £; 

(3) Each assignment a in A satisfies the condition \a\ < k; 

(4) If a G A, and /3 C a, then /3 G .4; 

(5) If a G A, \a\ < k, x G V, and a(x) = *, then there is a /? G .A, so that a C f3, and 

= or p{x) = 1. 

The next theorem is the analogue of Theorem 12.21 for regular resolution. 

Theorem 3.2. Let £ be a set of clauses, and k > u)(£). Then the following are equivalent: 

(1) There is no regular resolution refutation of £ of width k; 

(2) There is a regular extendible k + 1-family for £; 

(3) The Adversary wins the regular k + 1-width game based on £. 

Proof. ( 1 => 2): Let us suppose that there is no regular resolution refutation of £ of width 
k. Define C to be the set of all clauses having a regular resolution proof from £ of width at 
most k; since u>(£) < k, £ C C. Let A be the set of all extended assignments of size at most 
k + 1 that do not falsify any clause in C. We claim that A is an extendible k + 1-family for 
S. 

Since the empty clause A does not belong to C, the empty assignment is in A, so the 
first condition in Definition 13. II is satisfied. The second condition holds because £ C C, and 
the third condition by definition. The fourth condition also follows from the definition of 
A. 

It remains to prove the fifth condition. Assume that a G A, and |a| < k, x G V, and 
a(x) = *, but there is no extension (3 5 ct in A with (3(x) defined. Let a and a 1 be the 
extended assignments obtained from a by setting x to and 1, respectively. Since neither 
a nor a 1 belong to A, it follows that there are regular resolution derivations 1Z° and 1Z 1 
of clauses Co and C\, each having width at most k, so that for i = 0,1, a l {Ci) = 0. Since 
a G A, it follows that Co = DM x, and C\ = E V —>x, for some clauses D and E. However, if 
we extend the regular resolution derivations 7Z° and 7Z 1 by resolving on i, so that the final 
clause is DV E, the result is a regular resolution derivation of D V E, where a(D V E) = 0. 
Since \a\ < k, D V E has at width at most k, showing that D V E is in C; this contradicts 
our assumption that a G A. 

( 2 =£- 3 ): Second, let us suppose that there is a regular extendible k + 1-family for £. 
Then the Adversary can play the /c-width game on £ by responding to the Prover's queries 
with the appropriate assignment from the family, starting with the empty assignment. Since 
no assignment in the family falsifies an initial clause, this strategy must eventually end in 
a win for the Adversary, no matter how the Prover plays. 

( 3 1 ): Finally, let us suppose that there is a regular resolution refutation of £ of 
width k. Then the refutation provides the Prover with a winning strategy in the regular 
k + 1-width game based on £. Starting from the empty clause at the root, the Prover follows 
a path in the refutation so that at the end of each round, after the Prover has forgotten 
certain live variables, the remaining live variables are the domain of a minimal assignment 
falsifying the current clause. The variable queried is the variable resolved upon to derive 
the current clause. This strategy must result in a win for the Prover. □ 
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Corollary 3.3. The question "Is there a regular resolution refutation of the set of clauses 
S with width k?" is in PS PACE. 

Proof. Theorem 13.21 shows that this question can be answered by an alternating Turing 
machine operating in polynomial time. □ 

In the case of general resolution width, it is not clear whether the corresponding problem 
is in PSPACE, because there is no polynomial upper bound on how long the /c-width game 
might last. 

4. Size and width of regular resolution proofs 

4.1. The width and size of resolution proofs. Recent results on size separation between 
regular and general resolution also show a width separation. 

Theorem 4.1. For each n > 0, there is a contradictory set of clauses with 0(n 2 ) vari- 
ables and 0(n 3 ) clauses for which the general resolution width is bounded, but the regular 
resolution width is Q(n). 

Proof. The paper pQ implicitly contains such a separation. More specifically the family of 
clauses GT' n defined in §3 of pQ fulfil the conditions of the theorem. The D,(n) lower bound 
on regular resolution width is proved (implicitly) in Theorem 3.10 of that paper, which 
shows an exponential (2 n / 200 ) lower bound on the size of regular resolution refutations of 

gt; p . □ 

The author's paper [13] demonstrates an improved size separation between regular 
and general resolution; it also shows a width separation between the two forms of proof 
system. The main theorem shows that for infinitely many n, there is a set Tl n of 0(relog 5 n) 
clauses containing 0(n log log n) variables, where the maximum width of a clause in II n 
and the general resolution width are both O (log log n), while the regular resolution width 
is U(n/ logn). 

The results just described suggest a natural conjecture that a good lower bound on 
the regular width of a set of clauses leads to a good lower bound on the size of a regular 
refutation of them. For both general and tree resolution, Ben-Sasson and Wigderson [5] 
have proved strong results along these lines. 

Theorem 4.2. [Ben-Sasson and Wigderson 2001] Let S be a contradictory set of 
clauses with an underlying set of variables V . Then: 

(1) S T {T) > 2«-(^o)-»( s ); 

(2) S(£)=exp(0( ^ Sh y s » 2 )) 

Given the width and size separation results between regular and general resolution cited 
above, it seems reasonable to conjecture that the second lower bound proved by Ben-Sasson 
and Wigderson might hold, in the form where we replace "resolution size" by "regular 
resolution size," and "resolution width" by "regular resolution width." In the remainder of 
the paper, we show that this conjecture fails. 

Before proceeding to the main constructions, it may be helpful to the reader to clarify 
the relations between the various forms of resolution discussed here. 
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If we consider the size measure alone, then it is not hard to see that regular resolution 
is at least as powerful as tree resolution. This is because a pruning procedure [IT] [12j p. 
436] can be applied to a tree refutation to remove any irregularities while decreasing the 
size of the tree. On the other hand, the lower bound on width for regular refutations proved 
below does not apply to tree resolution, since the minimum width of a tree-style refutation 
of a set of clauses is the same as that of a general resolution refutation (we can convert any 
general resolution proof into a tree-style proof by repeating subderivations). 

However, if we insist on restricting our attention to tree-style refutation of minimum 
size, then the lower bounds on width do apply, since such refutations are necessarily regular. 

5. Pebbling games and pebbling formulas 

5.1. The pebbling game. A pointed graph G is a directed acyclic graph where all vertices 
have indegree at most two, having a unique sink, or target vertex, to which there is a 
directed path from all the vertices in G. It is binary if all vertices except for the source 
vertices have indegree two. If v is a vertex in a pointed graph G, then G\ v is the subgraph 
of G restricted to the vertices from which there is a directed path to v. 

The pebbling game played on a pointed graph G is a one-player game in the course of 
which pebbles are placed on or removed from vertices in G. The rules of the game are as 
follows; 

(1) A pebble may be placed on a source vertex at any time. 

(2) If all predecessors of a vertex are marked with pebbles, then a pebble may be placed 
on the vertex itself. 

(3) A pebble may be removed from a vertex at any time. 

A move in the game consists of placing or removing one of the pebbles in accordance with 
one of the three rules. The configuration at a given stage in the game is the set of vertices 
in G that are marked with a pebble. A play of the game begins with no pebbles on G. The 
goal of the game is to place a pebble on the sink vertex t, while minimizing the number of 
pebbles used (that is, minimizing the number of pebbles on the graph at any stage of the 
game). Thus a successful play of the game can be presented as a sequence of configurations 
Co, . . . , Ck, where Co = and t € Ck, where Cj + \ is obtained from Cj by one of the three 
rules. 

A strategy for the game is a sequence of moves following the rules of the game that 
ends in pebbling the target vertex. The cost of such a strategy is the minimum number of 
pebbles required in order to execute it, that is to say, the size of the largest configuration 
in the sequence of configurations produced by following the strategy. The pebbling number 
of G, written as j}G, is the minimum cost of a strategy for the pebbling game played on G. 

5.2. Pebbling formulas. We associate a contradictory set of clauses Peb(G) with every 
pointed graph G. Each vertex in G except the target t is assigned a distinct variable; to 
simplify notation, we identify a vertex with the variable associated with it, and use the 
notation Var(G) for the set of these variables. We associate the constant _L (falsum) with 
the target vertex t, and make the identification t = _L 

Definition 5.1. If G is a pointed graph, Peb(G) is a set of clauses expressed in terms of 
the variables Var(G), so that Peb(G) = {Clause(w) : v G G}. 
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(1) If v is a source vertex of G, then Clause(u) = v. 

(2) If v is a vertex in G, with predecessor u, then Clause(u) = u — > v. 

(3) If v is a vertex in G, with predecessors u, w, then Clause(u) = u,w ^ v. 

If we set some variables in Peb(G), then the resulting set of clauses is not necessarily 
of the form Peb(G"), where G' is a subgraph of G. We shall focus on a family of special 
assignments, called pebbling assignments, that preserve this property. If v G G, v ^ t, then 
we define the assignment := 1] to be the assignment defined by first setting the variable 
v to 1, and then setting to 1 any variable u for which there is no implicational chain from u 
to _L in the resulting clause set. The assignment \v := 0] is defined as follows: first, choose 
a directed path n = (v, . . . , t) from v to the target t, set all the vertices in the path to 0, and 
in addition set any vertex from which v is not reachable, but not in the path ir, to 1. The 
assignment {v := 0] is not uniquely determined by this construction, since it depends on 
the path chosen - however, this is not important, since the set of clauses Peb(G)f {v := 0] 
resulting from the restriction is independent of the path. A pebbling assignment results 
from a sequence of restrictions of the form ft; := 0] and {w := 1]. 

The effect of the restrictions just defined can be described directly as an operation on 
the underlying graph. If G is a pointed graph, and v G G, v ^ t, G[v := 1] is the graph 
resulting from G by first removing v, together with all edges entering or leaving v, and then 
restricting the resulting graph to the vertices from which the target vertex t is accessible. 
G[v := 0] is the pointed graph G\v. 

Lemma 5.2. 

(1) Forb = 0,1, Peb{G)\\v := bj = Peb{G[v := b}). 

(2) If G is a pointed graph, and v G G, then 

tfG < max^G^ := 0] ,%G[v := 1] + 1}. 

Proof. The first part of the lemma follows straightforwardly from the definitions. For the 
second part, we employ the following strategy in the pebble game on G; the strategy is the 
same as the one used in Lemma 15 of [1]. 

First, follow a minimum cost strategy to pebble v in G[v := 0]. Second, leaving a pebble 
on v, but removing all other pebbles, follow a minimum cost strategy in the pebbling game 
on G[v := 1] to pebble the target vertex in G, using the extra pebble for any moves 
where a pebble is needed on v to justify a placement. The cost of this strategy is at most 
max{$G[v:=Q],tG[v:=l] + l}. □ 

Example 5.3. If G is the pyramid graph shown in Figure 1, then Peb(G) is the set of 
clauses 

{a,b,c, d, (a,b e), (6,c-> /), {c,d^ g), (e,f h), (f,g ->• i), (h,i ->• _L)}. 

The assignment [e := 1] sets the variables a and e to 1; Peb(G) |" [e := 1] is Peb(G[e := 1]) 
where G[e := 1] results from G by removing the vertices a and e. If we choose the path 
/ — > h — > _L, then the assignment [/ := 0] sets the variables / and h to 0, while the variables 
a,d,e,g,i are all set to 1. The set of clauses Peb(G)t[/ := 0] is Peb(G[/ := 0]), where 
G[f := 0] is the subgraph of G containing only b, c and /. 

Lemma 5.4. If G is a pointed graph with n vertices, then Peb(G) has a tree resolution 
refutation with size 2n — 1. 
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Figure 1: A pyramid graph 



Proof. Starting with the clause Clause(_L) associated with the sink of G, construct a se- 
quence of purely negative clauses, working from the sink to the sources, by successive 
inferences using input resolution. Let C V —>w be the last clause in the sequence constructed 
so far, where w is not a source vertex, and C is purely negative. Resolve C V -iw against the 
clause Clause(u;) = (u, v — > w) to produce the next purely negative clause in the sequence, 
C V —>u V —>v. The sequence must end in a purely negative clause in which all the literals are 
of the form -is, where s is a source vertex. Now resolve each of these negative literals against 
the one-literal positive clauses corresponding to the sources. In this input refutation, each 
variable is resolved upon exactly once, so that the refutation has size In — 1. □ 

If £ is a set of clauses, then a C '-critical assignment is a total assignment to the variables 
in S that makes all the clauses true, except C. In the case of Peb(G), we are interested 
in a particular family of critical assignments. Let v be a vertex in G, and tt = (v, . . . , t) a 
directed path in G from v to the target vertex t. Set all the vertices in the path tt to 0, 
and all other vertices in G to 1. This assignment makes all of the clauses in Peb(G) true, 
except for Clause(f). An assignment determined by the path tt we shall call a v-critical 
assignment, since the clause that it falsifies is associated with the vertex v. Since we have 
assumed that G is a pointed graph, such v-critical assignments exist for all the vertices v 
in G, so that Peb(G) is minimally inconsistent. 

Lemma 5.5. If G is a pointed graph with jJG = p, then there are at least p vertices v in G 
for which there is a v-critical assignment for Peb(G). 

Proof. Every pebbling strategy for G must contain a configuration with p pebbles, so there 
must be at least p vertices in G. For every vertex in G, we can construct a ^-critical 
assignment for Peb(G) by choosing a path from v to the target vertex. □ 
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6. Constructing clause sets with large regular width 

6.1. The basic construction. To produce clause sets requiring large regular width, we 
start from the set of clauses Peb(G), where G is a pointed graph with n vertices. We use 
the abbreviation V for the set of variables Var(G), and V p for the set of all sequences of 
variables in V of length p. 

Let a be a function from G to V p , that is to say, a function associating a sequence 
of length p with every v € G. Thus, for each v G G, we have an associated sequence 
a(v) = <ti(v), . . . , <jp(v ), where each cjj{v) is a variable in V; the sequence may contain 
repetitions. Now for v G G, define the set Clauses' 7 (v) to be the set of all clauses having 
the form 

Clause(t;) V ±ai(v) V • • • V ±<r p (v), 
where =Lr, for r G V, is either r or —ir. Clauses' 7 (v) contains 2 P clauses of width at most 
p + 3. In addition, for A C G, define 

Clauses' 7 ^) = |J{ Clauses' 7 ^)]?; G A}. 

The construction just described can be considered as an iteration of the method used to 
construct the family of clauses GT' n defined in §3 of pQ. A key difference from the earlier 
construction is that the GT' n p examples begin from a set of clauses GT' n that is hard for 
tree resolution (though easy for general resolution), while the present construction begins 
from a set of clauses Peb(G) that is easy for tree resolution. 

The clause sets that we construct in this section are of the form Clauses' 7 (G), for G a 
pointed graph with n vertices. To ensure that these clause sets require large regular width, 
the map a must satisfy a combinatorial condition that can be stated roughly as follows: 
the image of any large set of vertices in G has a large intersection with any large set of 
variables. In the next subsection, we give a precise meaning to the term "large," and prove 
the existence of a function a satisfying the condition, by a probabilistic construction. 

For G a pointed graph, and a a function from G to V p , define Peb' 7 (G) to be Clauses' 7 (G). 
Peb CT (G) contains n ■ 2 P clauses of width at most p + 3. We shall show in what follows that 
the sets of clauses Peb CT (G), for an appropriate family of pointed graphs G and functions a, 
require large regular width, but on the other hand have regular tree resolution refutations 
whose size is linear in |Peb' 7 (G)|. 

6.2. A combinatorial lemma. In this subsection, we formulate and prove the existence 
result described above, by employing a probabilistic construction. If a G X k , and B C X, 
then we use the notation a D B for the set of all elements in the sequence a that also 
belong to the set B; similarly, if S is a set of such sequences, then S D B is defined to be 
[j{a n B\a G S}. For AQG, define a (A) = {a(v)\v G A}. 

Lemma 6.1. Let G be a pointed graph with n vertices, V = Var(G) the set of variables in 
Peb(G), and p = [log 5 n] . 

For any d > 0, and sufficiently large n, there is a map a from G to V p satisfying the 
condition: For all A C G and B C V with \ A\ = \B\ = [dn/logn\, \a(A)r\B\ > dn/2\ogn. 

Proof. Let us associate with each v G G a random subset of V with size p, chosen with re- 
placement. That is to say, with each v G G, we associate a sequence o~(v) = o~i(v), . . . , o~ p (v), 
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where each variable o~j (v) is chosen independently and uniformly at random from the set V 
of all variables. 

In the first part of the proof, let us consider the sets A and B to be fixed subsets of G and 
V respectively. Define a map a from G to V p to be bad for A and B if \a(A)nB\ > dn/2logn; 
otherwise good for A and B. We begin by proving that for fixed sets A and B, a random 
map a is bad with exponentially small probability. 

To prove this bound on the probability, it is convenient to consider the construction of 
the map as resulting from a series of independent choices. Divide the sequence l,...,p into 
q = [log 3 n\ blocks, so that each block contains at least 0(log 2 n) integers. That is to say, 
the sequence l,...,p can be written as a concatenation T\Ti, ... ,T q of sequences Tj, each of 
length at least 0(log 2 ra). 

Fix a block Tj, where 1 < j < q, and define a random variable Z representing the 
number of variables in B that are not in the random subset Tj(A), that is to say 

Z( Tj ) = \{xeB\x?T j (A)}\. 

We begin by estimating the expected value of Z. 

Let B = {b±, &2> • • • ; bi, . . . , b m } where m = [dn/ 'log raj. Define an indicator random 
variable 0, by: 



1, if bi^Tj (A) 

0, if bi G Tj(A), 

so that Z = @i H 1- m . We estimate the expected value of 0j by 

1 \W 
E(Qi) < (l- — 



< 1 



I ^ 0(n log n) 



n — 1 
< exp (— (log n)) , 

showing that 

E{Z) < m ■ exp (-Q (log n)) = m ■ o(l). 

It follows that for any given positive 7, E{Z) < jm, for sufficiently large ra. For the 
remainder of the proof, we assume that ra is chosen sufficiently large so that E(Z) < m/8. 

In the second stage of the proof, we need to show that the random variable Z is 
tightly concentrated around its mean. To do this, we employ a large deviation bound for 
martingales, following [Tj. 

Order the set A as {ai, . . . , a m }. The sequence 7^(01), Tj(a<2), . . . , Tj(a m ) represents a 
random subset of variables with size r = m ■ \tj\ = 0(nlogn). Let 1Z be the set of all 
sequences in V of length r. For a € 1Z, and 1 < t < r, define a\t to be the subsequence 
<Ti, . . . , at- Define an equivalence relation on 1Z by setting, for p, a G 1Z, 

p= t a <==^ p\t = a\t, 

for 1 < t < r, and let =0 be the universal relation on 1Z. Let Tt be the finite Boolean 
algebra whose atoms are the blocks of the partition of TZ induced by =j, for < t < r; the 
sequence To, . . . , T r of Boolean algebras forms a filtration over the set 7£. 

Define a sequence of random variables Zq,...,Z t by setting Zt = E(Z\Tt). Then 
Zq = E(Z), Z r = Z, and the sequence Zq, . . . , Z r forms a martingale [8j p. 221], the Doob 
martingale associated with the filtration J-q, ... , J>. The intuitive picture here is that at 
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time 0, we begin with no specific information about a given sequence a; we learn its values 
one by one at each successive time step t, until we have full information about a at time r. 

If p and a are two sequences in 1Z that differ at most at a single point, then \Z(p) — 
Z(cr)\ < 1. In the terminology of Alon and Spencer [21 p. 89], the random variable Z 
satisfies the Lipschitz condition relative to the filtration J^q, . . . , J- r . It follows by Theorem 
4.1 of Chapter 7 of the monograph by Alon and Spencer [2 S p. 90] that \Zt+i — Zt\ < 1. 
Consequently, by the martingale tail inequality of Hoeffding and Azuma [8, p. 221] [2[ p. 85], 

P(Z>m/2) < P(Z-E(Z) > 3m/8) 

< exp(-(3m/8) 2 /2r) 

< exp(— f2(n/log 3 n)). 

Let W be the random variable representing the number of variables in B not in the 
image of A under a: 

W{a) = \{x £ B\x &<t(A)}\. 
Since the maps Ti,...,r q are constructed independently, it follows that 

P(W > m/2) < [exp(-ft(n/log 3 n))]« = exp(-0(n)). 

We can now complete the proof of the existence of a map a satisfying the condition of 
Lemma [6. 11 The probability that a random map p € TZ is bad for some A and B is bounded 
by 



\ 2 

exp(— Q(n)). 

ml 



Using the simple inequality 



< 



en\ 



K kJ ~ \ k J ' 

found in Bollobas's textbook on graph theory [6j p. 216], the binomial coefficient above can 
be bounded by 



n \ /eny 



mj V to / 

. ,\0(n./logn) 
e O(loglogn) * 



gO(n log log n I log n) 

Hence, the probability can be bounded above by 

exp(0(nlog logn/ logn)) exp(— ft(n)) = exp(— Q(n)). 

Consequently, the probability that a random map p is bad for some A and B is exponentially 
small for sufficiently large n, showing that a map satisfying the condition of the lemma must 
exist. □ 
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7. Separating regular size and width 

Let G be a pointed graph, V the set of vertices in G (other than the sink) and N and p 
positive integers. We define a map a to be good for G, N, and p if it satisfies the condition: 
There is a map a from G into V p so that for any A C G and B C V, if \A\ = \B\ = [N\ , 
then |cr(v4) Pi B\ > N/2. Lemma 16.11 states that for any d > 0, given sufficiently large n, 
p = [~log 5 n], and N = dn/logn, for every pointed graph with n vertices, there is a map a 
that is good for G, N, and p. 

This lemma allows to construct a set of examples that have polynomial-size regular 
resolution refutations, but large regular width. The construction is based on the following 
result of Paul, Tarjan and Celoni. 

Theorem 7.1. [9] There is a sequence of binary pointed graphs Gi, . . . , Gi, . . . with pebbling 
number at least cn{i)/ log n{i), for sufficiently large i, where n{i) = \Gi\ = 0{i2 l ), and 
c > 1/20. 

It should be mentioned that the graphs Hi constructed by Paul, Tarjan and Celoni, 
though binary, are not pointed, since they are constructed to have multiple sink nodes. 
However, in their main theorem, they show that for sufficiently large i, their graph Hi 
contains a sink node that requires cn{i)/ log n(i) pebbles to pebble it, starting from the 
empty configuration. Hence, we can construct a pointed graph from Hi by choosing such a 
sink node, and considering the subgraph Gi containing all the nodes from which this sink is 
accessible. This subgraph Gi still satisfies the condition n(i) = \Gi\ = 0(i2 l ), so the main 
theorem of Paul, Tarjan and Celoni continues to hold, if we add the qualifier "pointed." 

Lemma 7.2. Let G be a pointed graph with n vertices, and a a map from G to V p , where 
p = [log 5 n] . Then the set of clauses Peb a (G) contains n—1 variables andn°( log n > clauses, 
and has a regular tree refutation with size n°' log n \ 

Proof. By Lemma \5A\ Peb(G) has an input refutation with size at most 2n — 1. For a given 
vertex v in G, the clause C(v) associated with the vertex can be derived from Clauses CT (?;) 
by a tree resolution proof with size 2°( log5n ) = n°' log n \ Consequently, Peb CT (Gj) has a 
tree refutation with size O(n) -n°( log n \ that is, n°( log n \ This tree refutation may not be 
regular; however, if irregularities are present, it is possible to remove them [11] |12l p. 436] 
resulting in a smaller regular tree-style refutation. □ 

Lemma 7.3. Let G be a pointed graph with pebbling number jJG = N and a a map that is 
good for G, N/2, and p. Then any regular resolution refutation of PeW (G) must contain a 
clause with width at least N/4. 

Proof. We prove the Lemma by showing that the Adversary wins the regular iV/4-width 
game based on Peb CT (G). The winning strategy has two stages. In the first stage, the 
Adversary maintains a pebbling assignment to G; at the start of the game, this assignment is 
empty. In the second stage, the Adversary answers according to a fixed v -critical assignment. 

Assume that it is the Adversary's turn, that it is the current pebbling assignment to 
the variables V in Peb CT (G), and that x is the variable currently queried by the Prover. The 
Adversary answers the current query according to these rules. 

(1) If the variable x is already assigned a value by it, then answer the query according to 
vr; 
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(2) If the variable queried is not assigned a value by w, then it must be associated with 
a node v G G\ir. Extend tt to a pebbling assignment 7r' so that tt' contains \v := 6], 
choosing b so as to maximize the pebbling number of G\tt'. 

The Adversary continues to play according to these rules until |_7V/ 2j nodes in G have been 
queried; when this happens, the first stage is completed. 

With the first stage completed, let a be the current extended assignment, tt the current 
pebbling assignment maintained by the Adversary; we assume that it is the Prover's turn. 
By Lemma 15.51 there are at least N/2 vertices v G G\tt for which there is a v -critical 
assignment for Peb(Gf7r). If is such a critical assignment, then tt U eft is a f-critical 
assignment for Peb(G). Let A be the set of all nodes in G satisfying this condition, and B 
the set of variables queried in the game so far. Because \A\, \B\ > \_N/2\ , \a(A)DB\ > N/4, 
since a is good for G, N, and p. 

Since the Prover and Adversary are playing the regular N/4- width game, it follows 
that \a\ < N/4 (since the current assignment after the Adversary's reply has width \a\ + 1). 
Hence, at least one variable v in <j{A) n B must be forgotten in a. Let <j> be a f-critical 
assignment for Peb(G); eft is also a u-critical assignment for Peb <J (G). In the second stage of 
the strategy, the Adversary answers all queries in accordance with the assignment 0. Since 
</> makes all of the clauses in Peb CT (G) true, except for a clause in Clauses(w) containing the 
variable v, this strategy results in a win for the Adversary, since the variable v is forgotten, 
so the Prover cannot query it again. □ 

Theorem 7.4. There is an infinite sequence £i, E2, • • • , Ej, • • • of contradictory sets of 
clauses and a corresponding list of parameters n(l),n(2), . . . , n(i), ... so that (abbreviating 
n(i) as n ): 

(1) Each clause set Sj contains n — 1 variables and n°' log ") clauses with width 0(log 5 n); 

(2) Ej has a regular tree refutation with size n°' log ™); 

(3) Any regular refutation of Ej must contain a clause with width f2(n/logn). 

Proof. Define Ej = Peb <J (Gj), where Gi is one of the sequence of pointed graphs in Theorem 
17.11 The theorem follows by Lemmas 16. 1\ 17.21 and 17.31 □ 

Although the clause sets in Theorem 17.41 have size quasi-polynomial in n, they have 
regular tree refutations that are linear in the size of the clause sets themselves. Furthermore, 
if we compute the significant quantities in the second part of Theorem \4.2\ we find that 
if the corresponding theorem held for regular size and width, then regular refutations of 
these clause sets would have to have size exponential in n/log 2 n. This shows that the 
relations between size and width holding for tree resolution and general resolution cannot 
be generalized to the case of regular resolution. 
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